Estimated estimating equations: Semiparametric inference for clustered/longitudinal data
نویسنده
چکیده
We introduce a flexible marginal modelling approach for statistical inference for clustered/longitudinal data under minimal assumptions. This estimated estimating equations (EEE) approach is semiparametric and the proposed models are fitted by quasi-likelihood regression, where the unknown marginal means are a function of the fixed-effects linear predictor with unknown smooth link, and variance-covariance is an unknown smooth function of the marginal means. We propose to estimate the nonparametric link and variance-covariance functions via smoothing methods, while the regression parameters are obtained via the estimated estimating equations. These are score equations that contain nonparametric function estimates. The proposed EEE approach is motivated by its flexibility and easy implementation. Moreover, if data follow a generalized linear mixed model (GLMM), with either specified or unspecified distribution of random effects and link function, the proposed model emerges as the corresponding marginal (population-average) version and can be used to obtain inference for the fixed effects in the underlying GLMM, without the need to specify any other components of this GLMM. Among marginal models, the EEE approach provides a flexible alternative to modelling with generalized estimating equations (GEE). Applications of EEE include diagnostics and link selection. The asymptotic distribution of the proposed estimators for the model parameters is derived, enabling statistical inference. Practical illustrations include Poisson modelling of repeated epileptic seizure counts and simulations for clustered binomial responses.
منابع مشابه
Semiparametric Regression for Clustered Data Using Generalized Estimating Equations
We consider estimation in a semiparametric generalized linear model for clustered data using estimating equations. Our results apply to the case where the number of observations per cluster is nite, whereas the number of clusters is large. The mean of the outcome variable Œ is of the form g4Œ5 D X‚C ˆ4T 5, where g4¢5 is a link function, X and T are covariates, ‚ is an unknown parameter vector...
متن کاملEfficient Estimation in Marginal Partially Linear Models for Longitudinal/Clustered Data Using Splines
We consider marginal semiparametric partially linear models for longitudinal/clustered data and propose an estimation procedure based on a spline approximation of the nonparametric part of the model and an extension of the parametric marginal generalized estimating equations (GEE). Our estimates of both parametric part and nonparametric part of the model have properties parallel to those of par...
متن کاملLocally efficient estimation of marginal treatment effects when outcomes are correlated: is the prize worth the chase?
Semiparametric methods have been developed to increase efficiency of inferences in randomized trials by incorporating baseline covariates. Locally efficient estimators of marginal treatment effects, which achieve minimum variance under an assumed model, are available for settings in which outcomes are independent. The value of the pursuit of locally efficient estimators in other settings, such ...
متن کاملA Semiparametric Marginalized Model for Longitudinal Data with Informative Dropout.
We propose a marginalized joint-modeling approach for marginal inference on the association between longitudinal responses and covariates when longitudinal measurements are subject to informative dropouts. The proposed model is motivated by the idea of linking longitudinal responses and dropout times by latent variables while focusing on marginal inferences. We develop a simple inference proced...
متن کاملEfficient semiparametric estimation in generalized partially linear additive models for longitudinal/clustered data
We consider efficient estimation of the Euclidean parameters in a generalized partially linear additive models for longitudinal/clustered data when multiple covariates need to be modeled nonparametrically, and propose an estimation procedure based on a spline approximation of the nonparametric part of the model and the generalized estimating equations (GEE). Although the model in consideration ...
متن کامل